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SUMMARY 


In  estimating  and  comparing  the  rates  of  change  of  a  continuous  variable 
between  two  groups,  the  unweighted  averages  of  individual  simple  least  squares 
estimates  from  each  group  are  often  used.  Under  a  linear  random  effects  model, 
when  all  individuals  have  complete  observations  at  identical  time  points  these 
statistics  are  maximum  likelihood  estimates  for  the  expected  rates  of  change. 
However,  with  censored  or  missing  data,  these  estimates  are  no  longer  efficient 
when  compared  to  generalized  least  squares  estimates.  When,  in  addition,  the 
right  censoring  process  is  dependent  upon  the  individual  rates  of  change  (i.e., 
informative  right  censoring),  the  generalized  least  squares  estimates  will  be 
biased.  Likelihood  ratio  tests  for  informativeness  of  the  censoring  process  and 
maximum  likelihood  estimates  for  the  expected  rates  of  change  and  the  parameters 
of  the  right  censoring  process  are  developed  under  a  linear  random  effect  models 
with  a  probit  model  for  the  right  censoring  process.  In  realistic  situations,  we 
Illustrate  that  the  bias  in  estimating  group  rate  of  change  and  the  reduction  of 
power  in  comparing  group  difference  could  be  substantial  when  strong  dependency 
of  the  right  censoring  process  on  individual  rates  of  change  is  ignored. 

Some  Key  Words'-  Informative  right  censoring;  Linear  random  effect;  Probit  right 
censoring;  Rate  of  change. 
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SECTION  1  :  Introduction 

In  clinical  trials  and  longitudinal  studies  it  is  often  of  interest  to 
estimate  and  compare  the  rates  of  change  of  one  or  more  variables  between  groups, 
in  e.g.,  lung  function  or  tumor  growth.  Furthermore,  comparing  the  rates  of 
change  of  a  continuous  response  variable  between  two  treatment  groups  is  often 
the  primary  objective.  Death  or  withdrawal  may  cause  some  observations  of  the 
primary  variable  to  be  right  censored. 

Growth  curve  methods  for  comparing  rates  of  change  have  been  studied 
extensively,  see  Rao  (1965),  Fearn  (1975)  and  Schlesselman  (1973).  Most  of  these 
analyses  assume  that  there  are  no  right  censored  or  missing  observations. 
Maximum  likelihood  and  generalized  weighted  least  squares  provide  alternative 
approaches  to  simple  least  squares  for  the  analysis  of  series  measurements  when 
some  observations  are  right  censored  or  missing.  Koziol,  et  i^l .  (1981)  proposed 
a  distribution-free  test  for  the  comparison  of  growth  curves  with  incomplete 
data.  In  order  to  be  valid,  these  procedures  require  that  the  probabilities  of 
right  censoring  or  missing  do  not  depend  on  the  parameter  values  of  the  response 
under  investigation,  i.e.,  they  are  non-informative  with  respect  to  the  response 
parameters. 

In  this  paper  we  are  primarily  interested  in  right  censoring  caused  by  the 
participant’s  death  or  withdrawal,  to  be  referred  to  as  the  primary  right 
censoring  process.  The  primary  right  censoring  process  could  be  informative  with 
respect  to  the  response  parameters.  In  our  development,  staggered  entry  and 
other  missing  value  processes,  if  incorporated,  are  assumed  to  be  non-informative 
and  Independent  of  the  primary  right  censoring  process. 


Under  a  linear  random  effects  model,  we  propose  a  model  which  can  depend 
both  on  the  Individual’s  initial  value  and  slope.  A  likelihood  ratio  test  for 
informativeness  and  maximum  likelihood  estimates  for  the  response  parameters  and 
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the  primary  right  censoring  process  coefficients  are  derived  under  a  probit  model 
for  the  probability  of  primary  right  censoring. 

The  right  censoring  is  considered  to  be  non- informative  with  respect  to  the 
response  parameters  if  the  likelihood  function  can  be  factored  into  two 
independent  parts,  one  corresponding  to  the  response  parameter  and  the  other 
corresponding  to  censoring  parameters. 

We  show  that  when  the  primary  right  censoring  is  non-informative,  the 
maximum  likelihood  estimates  for  the  average  linear  regression  coefficients  of 
the  response  are  weighted  linear  combinations  of  the  simple  least  squares 
estimates.  In  the  case  of  complete  observations  at  identical  time  points  among 
all  individuals,  these  estimates  are  just  the  unweighted  averages  of  the 
individual  simple  least  squares  estimates. 

The  proposed  method  is  applied  to  data  on  patients  with  PiZ  phenotype, 
gathered  by  the  workshop  on  Natural  History  of  PiZ  Emphysema  (1983).  To 
illustrate  the  effect  of  informative  right  censoring,  maximum  likelihood  and  the 
weighted  and  unweighted  least  squares  procedures  are  applied  to  a  set  of 
simulated  clinical  trials  with  primary  right  censoring  generated  from  a 
non-informative  probit  process  and  then  to  another  set  of  simulated  trials  with 
primary  right  censoring  generated  from  an  informative  probit  process.  Mean 
squared  error  and  power  comparisons  are  made  among  the  different  statistical 
procedures  and  between  these  two  sets. 


SECTION  2  :  Linear  Random  Effects  and  Informative  Right  Censoring 

We  assume  that  the  participants  of  a  longitudinal  study  are  divided  into  two 
treatment  groups  of  sample  sizes  n^.  for  k  =  1,2.  The  combined  sample  size  is  n 
=  n^+ng-  Let  there  be  J  identical  mortality  (and  withdrawal  status)  follow-up 
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time  points,  tj.  with  t^  =  0  and  tj  =  the  length  of  the  study.  Each  participant 
can  have  at  most  R  measurements  of  the  response  during  the  study.  The 
measurement  time  need  not  be  identical  among  individuals.  Let  =  total  number 
of  measurements  made  for  the  ith  individual.  Let  and  tJu  be  the  uth  response 
and  the  corresponding  measurement  time  for  the  ith  participant  in  the  combined 
sample  for  u=l,2 . and  i=l,2 . n.  With  ^=0,  let  t^  1  tj  if  death. 

withdrawal,  or  right  censoring  due  to  staggered  entry  occurred  for  the  ith 
participant  between  time  tj  and  tj+1;  otherwise  t^  =  tj  if  the  ith  participant 

was  not  right  censored  and  t^y  =  t.^,  =  tj  if  the  ith  participant  had  complete 

observations. 

It  is  assumed  that  the  serial  measurements  of  the  primary  variable  follow  a 
linear  function  of  time.  Let  ^  t*ie  unobservable  vector 

representing  the  true  initial  value  and  slope  of  the  primary  variable  for  the  ith 
individual  in  the  combined  sample.  For  i  €  k  and  k  =  1,2; 


li  =  Xi  £i  +  ti’  where  'li  ~  (Yn‘  •  •  Ylu  > 


(2.1) 


it  ~  N(5k’ V  ^  ti  ~  "e  X> 


=  1  .  1  1  ,  .  \ 

^11 . tiuj  P  o 


y  "Pl22  •Bk2)' 
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The  notation,  i  €  k,  is  used  to  denote  that  the  ith  participant  in  the  combined 

sample  belonged  to  the  kth  treatment  group. 

We  further  suppose  that  the  probability  of  being  primarily  right  censored 

due  to  death  or  withdrawal  during  a  specified  time  interval  (0,t  ),  given  p.,  is 

J 

M(aCp^ ,  tj).  Here  aC  =  (a^.a^)  is  the  vector  of  "regression  parameters"  relating 


it  ■  if.  4S  l>.  I*.  •».  !<.  a»> 


>V><>  s\  tVi«.  |i_  luiVi'ili’*  ii. 


1*4  ***  4». 


this  probability  to  the  primary  variables  p..  Examples  of  logical  choices  for  M 

«v* 

are  proportional  hazards  regression  (Cox  1972),  logistic  regression  (Walker  and 
Duncan  (1967))  and  probit  regression  (Halperin,  Wu  and  Gordon  (1979)).  For 
instance,  under  probit  regression  M(a(p,  t.)  =  $(0^/3  +  an ) ,  where  <$>  is  the 

A«  J  »V  VJ 

cuinnulative  probability  of  a  standard  normal  variate. 

a 

Since  for  each  P . .  (i)  the  simple  least  squares  estimates  P  = 

»v»  <v* 

(X^CXj)  ^(X^Y^)  (ii)  censoring  time  and  ( i i i )  survival  time  are  sufficient 
statistics  for  p..  it  suffices  to  consider  the  joint  distribution  of  p.,  P  and 

rV  1  X  X 

the  primary  right  censoring  process.  The  marginal  likelihood  for  B,  and  a  for 
the  ith  individual  can  be  expressed  as 


Li  ■  D  J  *Ai-  ?i-  ci,)VSi-5k-V  j2  «'-«J-1)C(1-J'1) 

(Mj  -  M^j)^1-1"15}  [l-MJ](1"m(i))d£i, 


(2.2) 


where  Mj  =  M(a’/3^,tj)  for  i  €  k,  k=l,2,  j=l,...,J;  =  0.  Here  C(i,j)  is  the 

indicator  function  that  the  ith  individual  was  censored  in  the  jth  interval 
because  of  staggered  entry,  Z(i.J)  is  the  indicator  function  that  death  or 
withdrawal  occurred  in  the  jth  interval  for  the  ith  individual,  D  is  constant 
with  respect  to  p. ,  a  and  B,  ,  and 

'V 1  r\*  'V** 


Cli  =  °l  V  *•  m(1)  =  2  +  • 

J 

The  notation  ^(Y,  p,  2)  represents  the  bivariate  normal  density  with  mean  vector 

«  <V  <M 

p  and  covariance  matrix  2.  C**  the  right  hand-side  of  equation  (2.2),  under  the 

integration  sign,  the  first  factor  represents  the  conditional  probability 


vstawam mns 
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distribution  of  0.  given  0.  ,  C(i,j)  and  Z(i,j)  for  j=l . J-l.  The  second 


factor  is  the  probability  distribution  of  0..  The  third  factor  of  products 

>v* 


corresponds  to  the  conditional  probabilities  that  the  ith  participant  survived 


the  (j-l)th  time  point  and  then  was  censored  by  staggered  entry  or  death  (or 


withdrawal)  between  the  (j-l)th  and  the  jth  time  points,  respectively  for 


j=2,...,J  given  0..  The  last  factor  represents  the  conditional  probability  that 


the  ith  participant  survived  the  entire  study,  given  0..  Therefore  the  product 


of  these  four  factors  is  proportional  to  the  joint  distribution  of  0..  0..  Z(i.j) 


and  C(i,j),  because  the  staggered  entry  process  and  the  missing  value  process  are 


assumed  to  be  non- informative  and  independent  of  the  primary  right  censoring 


process.  Hence,  integration  with  respect  to  the  vector  0  provides  the  marginal 

/x/  * 


likelihood  of  0.,  Z(i,J)  and  C(i.j)  with  respect  to  B,  and  a. 


This  notation  can  be  used  for  those  measured  only  at  baseline.  Equating  all 


elements  except  the  (l,l)th  of  and  1^  to  zero  and  letting  the  (1.1) th  element 


of  C.  .  equal  a  and  0  „  =  0  „  =  0.  The  marginal  likelihood  for  all  n 


individuals  is  the  product  of  the  individual  likelihoods. 


Joint  estimation  of  the  parameters  depends  on  the  ability  to  evaluate  (2.2) 


and  its  derivatives.  For  this  section  we  assume  that  I0  and  a  are  known.  The 

0  e 


more  realistic  case  will  be  discussed  in  the  next  two  sections.  In  principle 


(2.2)  can  be  evaluated  by  numerical  integration.  When  the  primary  right 


censoring  process  is  a  probit  model,  (2.2)  can  be  evaluated  explicitly:  for  i  €  k 


and  k  =  1,2, 


t„  -1, 


ln{Lj>  =  ln(D)  +  lnfA^  -  0.5^  -  (Pik  ~  j^)  +  ^ 


(2.3) 


where 


Ai  =  1^21 1 2 )  .  C2l  =  Cli  +  I„. 


the  generalized  least  squares  estimate  (GLSE) .  When  all  individuals  have 


complete  observations  measured  at  identical  time  points.  Si  will  be  the  same 
among  individuals,  in  which  case  (3.1)  reduces  to 


£uw,k  =  ,^k  £i  7  V 


the  unweighted  least  squares  estimate  (UWLE).  The  covariance  matrices  are, 


CGL.k  *  [(|k  C21  ']  SjW.k  -  [  C2*  . 


2  C2i 

i£k  K 


(3.2) 


When  2„  and  a  are  unknown,  the  following  unbiased  estimators  can  be 
P  ^ 


substituted, 


2  2  . 
=  s  / 
e 


2  (i»  -  2)1. 

4=1  1  J 


2^  —  Sp  /(n  1)  —  2  C^/n  , 


(3.3) 


2  n 

k=i  ick  l~~l '  W(2‘ '  w‘  “d  s/  *  xi£i>  • 

A 

However,  2^  has  the  disadvantage  that  it  is  not  necessarily  positive  definite. 
The  procedure  given  by  Bock  and  Peterson  (1975)  for  constructing  an  estimate  that 
is  at  least  semi-definite  will  be  used. 

When  the  goal  of  a  study  is  to  compare  differences  in  rate  of  change  between 
two  groups,  we  wish  to  test  the  null  hypothesis,  against  the 

alternative  hypothesis,  :  B^  <  ®22‘  test  stat*st*c  *s  of  the  form. 


(£,2  -  SjjVi 2  ♦  . 


12  22 


with  Bj^  =  and  By^  ^  respectively.  For  shifted  alternatives,  sample 


size,  power  and  significance  level  of  the  test  can  be  related  according  to  the 
approximate  formula. 


k  2  +  2kz„  +  z«)2  =  a2  . 


B12  B22  a  P 


(3.4) 


where  A  is  the  difference  in  expected  rates  of  change  we  wish  to  detect,  a  and  p 
are  the  Type  I  and  Type  II  errors  of  the  test  with  and  the  unit  normal 
deviates  corresponding  to  a  and  p. 


REMARKS  :  We  have  by  assumption  that  an  individual’s  coefficient  estimate  is 
unbiased,  i.e., 

e[3,  i  p,  ]-/>,. 

Thus  the  unweighted  least  squares  estimate  is  unbiased  for  B^.  There  are  two 
cases.  When  the  primary  right  censoring  is  non-informative,  the  distribution  of 
(Dgj  in  (3.1)  does  not  depend  on  Pi ,  so  that  the  GLSE  and  UWLE  are  both  consistent 
and  unbiased  estimators  of  R  ,  although  of  course  the  UWLE  is  less  efficient. 

Furthermore,  the  relative  differences  between  the  variances  and  hence  the 

required  sample  sizes  of  the  UWLE  and  the  GLSE  for  the  slope  or  initial  value  are 

2  2  2  2 

a  function  of  ( a  /aR  )  or  (a  /aR  )  respectively.  When  the  primary  right 

e  p g  fc  p  j 

censoring  process  is  informative,  the  unweighted  least  squares  estimate  is  still 
unbiased,  although  the  GLSE  is  not  because  and  P ^  are  dependent. 


SECTION  4  : 


>les  and  Simulations 


This  paper  was  motivated  by  design  and  analysis  problems  encountered  in  many 
clinical  trials  concerning  lung  diseases,  e.g.,  the  Intermittent  positive 


pressure  breathing  trial  (IPPB  1983)  for  chronic  pulmonary  diseases. 


One 
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specific  example  was  the  feasibility  study  of  an  anti-proteolytic  replacement 
therapy  trial  among  individuals  with  PiZ  phenotype,  conducted  by  the  Workshop  on 
the  Natural  History  of  PiZ  emphysema.  The  association  between  severe  alpha^  - 
antitrypsin  deficiency  and  lung  diseases,  particularly  pulmonary  emphysema,  has 
been  observed  since  the  early  1960's  (Laurell  and  Eriksson  (1963)).  Individuals 
with  PiZ  phenotype  tend  to  develop  severe  alpha^  antitrypsin  deficiency  and  hence 
pulmonary  emphysema  and  more  rapid  decline  in  lung  function.  The  planned  trial 
was  designed  to  detect  differences  in  rates  of  decline  of  a  one  second  forced 
expiratory  volume  (FEV^)  between  a  control  and  a  therapeutic  group. 
Retrospective  data  on  PiZ  individuals  were  gathered  from  the  ten  participating 
institutions  (see  Workshop  on  Natural  History  of  PiZ  Emphysema  (1983))  to  provide 
crude  estimates  of  parameter  values  required  for  sample  size  calculations. 


4.1  :  Estimation  and  Testing 


A  Fortran  program  was  developed  for  estimation  when  there  is  no  staggered 

entry.  The  method  of  pseudo  maximum  likelihood  estimation  (PMLE  -  see  Gong  and 

2 

Samaniego  1981)  was  used.  Estimates  of  a  and  were  made  according  to  (3.3) 

£  P 

and  the  Bock  and  Peterson  (1975)  procedure  and  substituted  into  (2.3),  which  was 

then  maximized  by  the  Newton-Raphson  method.  The  algorithm  first  calculates  the 

2 

simple  least  squares  intercept  and  slope  for  each  individual  and  estimates  a 
and  The  UWLE  of  B^  is  used  as  initial  value  for  B^  in  calculating  d^  and 

for  each  individual  according  to  (2.3).  Partial  derivatives  of  the  log 
likelihood  for  the  Newton-Raphson  iterative  procedure  are  then  calculated  using 
initial  values  for  a^,  ag.  ao2’  ••••  aoj’  Formulae  for  these  partial  derivatives 
are  presented  in  the  Appendix.  Note  that  the  initial  values  for  the  a’s  can  be 
chosen  arbitrarily  with  the  constraint  <  ...  <  a^j. 

This  algorithm  was  applied  to  the  PiZ  emphysema  data. 


Among  the  data 


gathered  for  294  PiZ  individuals  by  the  ten  U.S.  institutions,  initial  and 
follow-up  FEVj  values  (with  the  initial  and  the  last  measurements  at  least  6 
months  apart)  were  available  on  117  individuals.  The  number  of  FEVj  measurements 
ranged  from  2  to  12  (mean  number  of  measurements  =  3.8).  The  duration  between 
the  initial  and  the  last  measurement  ranged  from  6  to  227  months  (mean  duration  = 
52  months).  Since  the  proposed  trial  duration  was  between  three  and  six  years, 
an  analysis,  corresponding  to  a  three  year  follow-up  study,  was  first  made  using 
the  initial  and  all  follow-up  FEV^  measurements  made  within  three  years  of  the 
initial  measurement.  Since  many  did  not  have  reported  follow-up  FEV^’s  within 
three  years  of  the  initial  measurement,  only  81  individuals  with  8  deaths  were 
included  in  this  analysis.  A  second  analysis,  corresponding  to  a  six  year 
follow-up  study,  was  also  made  among  those  with  a  minimum  follow-up  of  six  years 
or  a  reported  death  within  the  first  six  years.  Follow-up  FEV^’s  within  six 
years  of  the  initial  measurement  were  used.  This  analysis  included  65 
individuals  with  19  deaths.  Because  of  the  small  number  of  deaths,  mortality 
follow-ups  were  grouped  into  two  equal  length  intervals  for  both  analyses.  The 
average  number  of  FEV^  measurements  were  2.9  and  3.6  and  the  average  duration 
between  the  initial  and  the  last  FEV^’s  were  28  and  48  months  for  the  3  and  6 
year  follow-ups.  respectively.  Those  individuals  with  only  one  FEV^  measurement 
were  not  included  in  these  analyses.  This  has  the  effect  of  causing  a  slight 
bias  in  the  unweighted  least  squares  analysis  and  a  slight  loss  of  efficiency  in 
the  informative  censoring  analysis. 

hmhmmmhhwkmwxxwxw  Table  1  about  here  xmhxxhxxxxxxxkw 
The  purpose  of  these  analyses  was  to  test  for  informativeness  of  the  right 
censoring  caused  by  participant’s  death  with  respect  to  FEV^  initial  value  and 
slope,  obtained  from  3  and  6  yee.r  follow-ups,  respectively;  and  to  derive  crude 
estimates  of  the  primary  right  censoring  coefficients.  The  initial  values  used 


for  the  iterative  procedure  were  =  -1.35,  =  -0.90  and 


The 


algorithm  converged  after  12  and  10  iterations  for  the  first  and  second  analyses, 
respectively.  The  results  are  presented  in  Table  1.  The  estimated  probit  right 
censoring  coefficients  for  FEVj  initial  value  and  slope  (ctj  and  a were  -3.8, 
-11.3  and  -4.6,  -13.8  for  the  two  analyses,  respectively.  Likelihood  ratio  tests 
Indicated  that  the  coefficients  for  FEVj  initial  value  (dj)  were  statistically 
significantly  different  from  zero  in  both  analyses.  Although  the  chi-squared 
statistic  (with  one  degree  of  freedom)  of  2.8  for  the  slope  coefficient  ( a of 
the  first  analysis  was  not  statistically  significant  at  a  5%  level,  the 
chi-squared  statistic  of  7.1  for  the  slope  coefficient  of  the  second  analysis  was 
statistically  significant.  The  significance  of  the  initial  value  coefficients 
and  the  large  negative  slope  coefficients  obtained  from  both  analyses,  the 
significance  of  the  slope  coefficient  from  the  second  analysis  indicated  that  the 
right  censoring  by  participants'  deaths  could  be  informative  with  respect  to  both 
FEVj  initial  value  and  slope. 

Survival  probability  distributions  estimated  by  the  product  limit  method 
(Kaplan  and  Meier,  1958),  for  the  entire  data  set  of  294  individuals,  for  those 
individuals  included  in  the  first  and  second  analyses  of  Table  1,  respectively; 
and  for  the  117  individuals  with  two  or  more  FEV^  measurements  are  displayed  in 
Figure  1.  Since  these  data  were  collected  retrospectively,  mortality  follow-ups 
were  not  as  complete  and  rigorous  as  one  would  like  them  to  be  for  the  proposed 
prospective  study.  Hence,  survival  probabilities  in  Figure  1  could  be 
optimistic. 

The  estimates  we  have  proposed  are  of  course  sensitive  to  model 
misspecif ication.  When  using  the  estimation  and  test  procedures  derived  under 
the  probit  model,  goodness-of-f i t  to  the  data  should  be  checked.  One  approach  is 
to  note  that  the  estimated  probability  for  the  ith  individual  being  primarily 

A  A  A 

right  censored  in  the  Jth  time  interval,  for  given  f3. ,  is  P.  .  =  <J)(U.  .  .)  - 

A/*  ^  J  1 

.  A  * 

q^Ujj),  for  J=1 .... ,  J-l ;  where  Ujj  is  U^j  of  (2.3)  with  a  and  and  the 


expected  intercept  and  slope  for  the  primary  variable  being  replaced  by  their 
MLE’s.  Therefore,  group  the  4*(Ujj)  into  groups  and  compute  for  each  group,  = 

/N 

SjPjj.  Then  compare  Ej  with  the  observed  deaths  and  dropouts  between  the  jth  and 
(j+l)th  time  points. 

For  the  PiZ  six  year  follow-up  data  of  Table  1,  the  observed  number  of 
deaths  among  those  whose  estimated  probabilities  of  death  in  six  years  were  above 
the  85th  percentile,  between  the  70th  and  85th  percentiles  and  below  the  70th 
percentile  (for  the  entire  65  individuals)  were  4,  2,  2  and  3,  4,  4  for  the  first 
and  second  three  year  intervals,  respectively.  The  corresponding  expected 
numbers  of  death  were  4.47,  1.82,  0.85  and  2.98,  3.70,  3.88  for  the  two  time 

intervals,  respectively.  Hence,  the  probit  model  seems  to  fit  the  data 
reasonably  well. 

Graphical  comparison  of  the  actual  versus  expected  cumulative  numbers  of 
death  by  the  estimated  probability  of  death  in  six  years  for  the  six  year 
follow-up  data  is  displayed  in  Figure  2A.  Comparisons  of  the  actual  versus 
expected  cumulative  numbers  of  death  in  the  first  and  second  three  year  intervals 
by  the  estimated  probability  of  death  in  the  corresponding  time  intervals,  for 
the  same  six  year  follow-up  data,  are  shown  in  Figure  2B.  The  overall  fits  of 
the  data  from  both  figures  were  again  reasonably  good. 

4,2  :  The  effect  of  informative  censoring 

The  UWLE,  GLSE,  and  the  PMLE  were  compared  in  simulated  experiments  based  on 
the  model  (2.2)  with  the  following  primary  right  censoring  processes:  (1) 
probit  non-informative  censoring  with  aj  =  a2  =  (2)  probit  informative 

censoring  with  coefficients  =  -3.8,  a ^  =  11.3;  (3)  probit  informative 

censoring  with  coefficients  =  -4.6,  ag  =  -13.8,  corresponding  to  the  two 

analyses  of  Section  4.1;  and  (4)  probit  informative  censoring  with  =  -3.8  and 
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a2  =  0.  Similar  to  the  IPPB  trial  (1983),  the  study  duration  was  assumed  to  be 

three  years  with  four  FEV^  measurements  per  year.  The  expected  FEV^  slope  and 

initial  value  in  the  control  group  and  the  within  and  between  individual 

variances  used  were  all  estimated  from  the  PiZ  data.  A  50%  reduction  in  FEV^ 

rate  of  decline  was  assumed  in  the  treatment  group.  Equal  sample  sizes  of  100 

each  were  generated  for  the  two  groups.  In  the  IPPB  trial,  similar  to  the 

proposed  trial,  patients  were  required  to  have  their  FEV^  values  less  than  65% 

predicted  at  entry  and  the  comparison  of  FEV^  annual  rates  of  decline  between  two 

randomized  treatment  groups  was  the  primary  objective  of  the  trial.  The  primary 

right  censoring  rate  for  the  IPPB  trial  was  more  than  12%  per  year.  For  these 

illustrations  the  probability  of  primary  right  censoring  was  assumed  to  be  16% 

each  year  for  all  individuals  under  the  non-informative  right  censoring  process. 

When  the  informative  probit  model  was  used,  this  probability  was  assumed  to  be 

16%  for  an  individual  whose  initial  value  and  slope  were  equal  to  the  expected 

values  for  the  control  group.  It  was  further  assumed  that  there  is  no 

correlation  between  the  slope  and  the  initial  value  (crR  R  =0).  The  decision 

P1P2 

A 

value  used  for  rejecting  the  null  hypothesis  of  no  difference  was  (B^  “ 
i 

~  2  a  o 

B oo)/(CTn  +  an  )2  <  -1.645.  Normal  random  numbers  were  generated  by  the  IMSL 

22  12  “22 

routine  GNPM.  The  experiments  were  repeated  600  times. 

The  results  in  Table  2  indicate  that  the  UWLE  procedure  remained  relatively 
unbiased  in  estimating  the  mean  FEV^  slope  for  each  group  and  the  between  group 
difference  in  slopes.  However,  the  PMLE  clearly  had  much  smaller  mean  squared 
errors  in  estimating  the  individual  group  mean  slopes  and  the  between  group 
differences,  and  much  higher  statistical  power  in  detecting  the  between  group 
differences  in  all  four  censoring  processes  considered.  The  GLSE,  although  most 
efficient  under  non-informative  censoring,  resulted  in  large  under-estimations  of 
individual  group  mean  FEV.  rates  of  decline  (24-46%),  under  the  two  probit 


informative  censoring  processes  with  non-zero  coefficients  for  FEV^  slope.  The 
under-estimation  for  the  between  group  differences  were  much  smaller  (11-13%), 
because  under  the  shifted  alternative  of  equation  (3.4),  the  biases  in  the  two 
group  estimates  tend  to  cancel  with  each  other.  The  GLSE  had  smaller  mean 
squared  errors  in  estimating  the  between  group  differences  and  higher  statistical 
power  to  detect  these  differences  than  the  ITWLE  in  all  four  censoring  processes 
considered.  Compared  to  the  PMLE,  under  the  two  probit  informative  censoring 
processes  with  non-zero  slope  coefficients,  the  GLSE  had  much  larger  mean  squared 
errors  (39-69%)  in  estimating  the  individual  group  mean  slopes  and  (14-22%)  in 
estimating  the  between  group  differences;  and  lower  statistical  power  (10-15%)  in 
detecting  the  between  group  differences.  The  expected  power  for  the  proposed 
study,  calculated  according  to  (3.4)  using  the  assumed  parameter  values,  was  0.85 
for  the  GLSE  under  the  non-informative  censoring  process.  The  simulated  power 
for  the  GLSE  under  non-informative  censoring,  using  the  estimated  within  and 
between  individual  variances  according  to  (3.3)  and  the  Bock  and  Peterson  (1975) 
procedure  for  constructing  covariance  matrices  that  were  at  least  semi-definite 
was  0.81,  and  not  very  different  from  the  expected  power.  Using  the  PMLE  when 
the  censoring  process  was  non-informative  or  when  the  probit  censoring  slope 
coefficient  was  zero  could  result  in  larger  mean  squared  errors  than  the  GLSE.  in 
estimating  the  group  slopes.  The  simulated  significance  levels  were  not  much 
different  from  the  expected  5%  level  for  all  procedures  in  Table  2. 


Table  2  about  here 


SECT  I  Off  5  :  Discussion 


The  probit  model  used  in  Sections  2  and  4  is  not  necessarily  meant  to  be 
biologically  valid  for  describing  the  underlying  right  censoring  process. 
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Indeed,  Che  choice  of  the  probit  was  made  primarily  on  computational  grounds,  and 
because  logistic  and  problt  regressions  give  similar  estimates  of  event 
probabilities  (Halperin,  et  al.  (1979)). 

When  using  the  estimation  and  test  procedures  derived  under  the  probit 
model,  goodness-of-f it  to  the  data  should  be  checked  as  suggested  in  Section  4.1. 
However  the  distribution  of  the  chi-squared  goodness-of-f i t  test  statistic  for 
this  situation  cannot  be  obtained  from  a  straightforward  application  of  the  usual 
theory  because  (i)  parameter  estimates  are  determined  using  likelihood  functions 
for  ungrouped  data;  and  { i i )  random  cell  boundaries.  Moore  (1971)  and  Moore  and 
Spruill  (1975)  derived  large  sample  distribution  of  the  usual  chi-squared 
goodness-of-f it  statistics  under  these  two  problems.  Their  basic  result  is  that 
under  appropriate  regularity  conditions  the  large  sample  distribution  of  the 
goodness-of-f it  statistic  is  that  of  a  central  chi-squared  with  the  usual 
reduction  in  degree  of  freedom  due  to  estimated  parameters  plus  a  weighted  sum  of 
independent  chi-squared  random  variables  each  with  one  degree  of  freedom. 
Application  of  their  result  to  this  problem  is  under  investigation. 

Although  the  estimation  and  test  procedures  of  Sections  2  and  4  were 
developed  for  k=2  groups,  they  could  be  extended  easily  to  the  case  of  k  >  2 
groups.  To  test  for  equality  or  linear  trend  among  the  expected  slopes  of  the 
k>2  groups,  the  likelihood  ratio  chi-squared  statistic  could  be  used. 

The  standard  errors  provided  in  Table  1  for  estimates  based  on  the  probit 
right  censoring  model  and  those  used  in  computing  the  test  statistics  for  the 
PMLE  in  Table  2  were  estimated  from  the  sample  information  matrix  based  on  the 


pseudo  maximum  likelihood,  by  assuming  that  the  estimated  between  and  within 
individual  error  variances  were  the  true  values,  rather  than  based  on  the  maximum 


likelihood.  The  bootstrap  (Efron,  1979)  could  be  used  to  improve  these 


estimates.  Alternatively,  the  maximum  likelihood  procedure,  treating  ofe  . 

2 

and  an  as  additional  parameters,  could  also  be  used. 
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ik  =  (dikl’dik2*'  pi  =  a2il2  7  ^a2il  *  CT2i2^’  *ij  =  ^ij  =  for 
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J=2 . J  *u  =  4>tl  =  0.  From  (2.3),  =  (aoj  +  ai  ^ikl  +  a2  dJk2)  /  D2. 

2  2  2  2 

for  j=2 . J:  where  D  =  (1  +  a^iX  ctj  +  CT3n2ala2  +  CT3i2  a2  ^ '  The  Paraineter  to 

be  estimated  is  0C  =  (0j . 0j+5)  =  (Bn ,B12’B21 ,B22’al ,a2,a02 . a0J^'  The 

partial  derivatives  of  the  log  likelihood  (2.3)  with  respect  to  these  parameters 
are  as  follows: 
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=  0 


j  =2 . J  ; 


where  C  =  or  a  +  ct_.i0  a_  for  7  =  1.2; 
t  3i7  7  3il2  3-7 
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CT3i£ 
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CT3i  12 

for 
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Table  1 

Estimation  for  the  Expected  FEV^  Slope,  the  Missing  Value  Coefficient 
and  Likelihood  Ratio  Tests  Statistics  for  the  Coefficients 


Estimates  and 
Test  Statistics 


Three  Year  Mortality 

Three  Year  FEV^ 
Follow-Up 


Six  Year  Mortality 

Six  Year  FEVj 
Follow-Up 


>*  Estimated  FEV^  change/vear 

V, 


VC 

Unweighted 

-0.093 

( .0164)* 

-0.078 

(0.0138)* 

i 

Weighted 

-0.090 

(.0151) 

-0.076 

(0.0136) 

K  . 

k/ 

k 

,  *■ 

Probit  Informative  Missing 

-0.095 

(.0152) 

-0.085 

(0.0133) 

k>  * 

k./ 

Estimated  Missine  Value  Coefficients 

5; 

1* 

FEVj  Initial  Value 

-3.80 

(2.01) 

-4.61 

(1.70) 

£ 

tv  ■ 

V 

FEVj  Slope 

-11.30 

(7.46) 

-13.80 

(6.76) 

• 

°02 

-0.53 

(1.10) 

1.25 

(0.93) 

’/ 

V. 

a03 

0.42 

(1.10) 

2.42 

(1.05) 

V. 

1 

L.R.  Test  Statistics 

$ 

A 

Initial  Value,  vs. 

H0(X21) 

11.02 

26.97 

\ 

1 

“  * 

Ji 

Slope,  H2  vs.  H^X2!) 

2.81 

7. 13 

y. 

a 

No.  at  risk  at  baseline 

81 

65 

4  ■ 

No.  of  Deaths 

8 

19 

L 

/j 

Numbers  in  parentheses  are 

estimated 

standard 

errors . 

f 


s 

« 


s 


Informative  Censoring 


Statistical 
Procedures  & 
Treatment 
Groups 

Non-informative 

Censoring 

al  =  a2  =  0 

a1=~3-8 

a2=-11.3 

a 

a1=-4.6 

2 =-13 . 8 

( 

a1=3.8 

x2=-0.0 

FEV 

FEV 

FEV 

1 

FEV 

Slope 

MSE 

Slope 

MSE 

Slope 

MSE 

Slope 

MSE 

Control  Group 

UWLE 

-89.3 

465.4 

-88.9 

645.7 

-88.3 

719.4 

-89.2 

570.4 

GLE 

-90.3 

156.3 

-68.3 

634.9 

-63.7 

883.3 

.90.7 

164.6 

PMLE 

-89.4 

-83.6 

455.0 

-81.2 

597.8 

-90.2 

210.6 

Treatment  GrouD 

UWLE 

-46.5 

442.3 

-45.9 

444.9 

-46.2 

489.8 

-46.9 

444.7 

GLE 

-45.9 

148.2 

-28.2 

423.1 

-24.4 

576.0 

-46.3 

150.7 

PMLE 

-44.6 

194.4 

-39.7 

288.4 

-38.8 

340.6 

-44.3 

162.9 

Between  Group  Differences 

UWLE 

-42.8 

935.8 

-43.0 

1059.3 

-42.11 

1210.9 

-42.3 

1052.0 

GLE 

-44.5 

361.8 

-40.1 

339.0 

-39.3 

377.8 

-44.4 

313.0 

PMLE 

-44.8 

267.4 

-43.9 

297.1 

-42.4 

309.0 

-44.  1 

297.3 

Simulated  Power  &  Sienificance  Level 

Power 

Sienif 

Power 

Sienif 

Power 

Sienif 

Power 

Sieni 1 

UWLE 

0.45 

0.05 

0.39 

0.05 

0.36 

0.05 

0.40 

0.05 

GLE(0.85) 

0.81 

0.06 

0.72 

0.07 

0.67 

0.07 

0.77 

0.06 

PMLE 

0.80 

0.05 

0.80 

0.06 

0.79 

0.06 

0.78 

0.07 

The  parameter  values  used  were:  Measurement  error  standard  deviation  =  0.155 

FEV.  initial  value  standard  deviation  =  0.39L.,  FEV.  slope  std.  dev. 

P1  1  P2 
0.091L/yr.,  a ^  =  0.091L/yr.t  ^  =  0,  expected  FEV^  initial  value  =  B9^ 

0.96L.  control  and  treatment  group  expected  FEV^  slopes  B^  =  -0. 09L/yr.  and  B ^ 

-0.045/yr.  For  significance  level,  -  B?<?  =  -,09L/yr.  The  probability  of  missing 
16%/yr .  and  n^  =  =  100. 

Expected  power  under  non-informative  missing  process,  calculated  according  to  (3.4 
using  the  actual  parameter  values. 
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